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Abstract. The topic of this paper are integer programming models in which 
a subset of 0/1-variables encode a partitioning of a set of objects into disjoint 
subsets. Such models can be surprisingly hard to solve by branch-and-cut 
algorithms if the order of the subsets of the partition is irrelevant, since this 
kind of symmetry unnecessarily blows up the search tree. 

We present a general tool, called orbitopal fixing, for enhancing the capa- 
bilities of branch-and-cut algorithms in solving such symmetric integer pro- 
gramming models. We devise a linear time algorithm that, applied at each 
node of the search tree, removes redundant parts of the tree produced by the 
above mentioned symmetry. The method relies on certain polyhedra, called 
orbitopes, which have been introduced in [18]. It does, however, not explicitly 
add inequalities to the model. Instead, it uses certain fixing rules for variables. 
We demonstrate the computational power of orbitopal fixing at the example 
of a graph partitioning problem. 



1. Introduction 

Being welcome in most other contexts, symmetry causes severe trouble in the so- 
lution of many integer programming (IP) models. This paper describes a method 
to enhance the capabilities of branch-and-cut algorithms with respect to handling 
symmetric models of a certain kind that frequently occurs in practice. 

We illustrate this kind of symmetry by the example of a graph partitioning 
problem (another notorious example is the vertex coloring problem). Here, one is 
given an undirected graph G = {V,E) with non-negative edge weights w £ (Qfg 
and an integer q > 2. The task is to partition V into q disjoint subsets such that 
the sum of all weights of edges connecting nodes in the same subset is minimized; 
thus, this problem is equivalent to maximizing the weights of the edges in a g-cut. 

A straight-forward IP model for this graph partitioning problem arises by in- 
troducing 0/1-variables Xij for all i E [p] := and j G [q] that indicate 
whether node i is contained in subset j (where we assume = [p]). In order to 
model the objective function, we furthermore need 0/1-variables yik, for all edges 
{j, k} £ E, indicating whether nodes i and k are contained in the same subset. This 
yields the following IP-model (see, e.g., [5]): 
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The x-variables describe a 0/1-matrix of size p x q with exactly one 1-entry per 
row. They encode the assignment of the nodes to the subsets of the partition. The 
methods that wc discuss in this paper do only rely on this structure and thus can be 
applied to many other models as well. We use the example of the graph partitioning 
problem as a prototype application and report on computational experiments for 
this appHcation in Sect. 7. 

Graph partitioning problems are discussed, for example, in [3, 4, 5]. They arise, 
for instance, as relaxations of frequency assignment problems in mobile telecom- 
munication networks, see [5]. The maximization version (min fc-cut) of the graph 
partitioning problem is relevant as well [9, 19]. Also capacity bounds on the subsets 
of the partition (which can easily be incorporated into the model) are of interest, in 
particular for the graph equipartition problem [10, 11, 29, 34]. For the closely re- 
lated clique partitioning problem see [14, 15]. Semidefinite relaxations and solution 
approaches are discussed in [6, 13]. 

As it is given above, the model is unnecessarily difficult for state-of-the-art IP 
solvers. Even solving small instances requires enormous efforts (see Sect. 7). One 
reason is that every feasible solution (x, y) to this model can be turned into q\ dif- 
ferent ones by permuting the columns of x (viewed as a 0/1-matrix) in an arbitrary 
way, thereby not changing the structure of the solution (in particular: its objective 
function value). Phrased differently, the symmetric group of all permutations of 
the set [q] operates on the solutions by permuting the columns of the .x-variables in 
such a way that the objective function remains constant along each orbit. There- 
fore, when solving the model by a branch-and-cut algorithm, basically the same 
work will be done in the tree at many places. Thus, there should be potential for 
reducing the running times significantly by exploiting this symmetry. A more sub- 
tle second point is that interior points of the convex hulls of the individual orbits 
are responsible for quite weak linear programming (LP) bounds. We will, however, 
not address this second point in this paper. 

In order to remove symmetry, the above model for the graph partitioning problem 
is often replaced by models containing only edge variables, see, e.g. [10]. For this, 
however, the underlying graph has to be complete, which might introduce many 
unnecessary variables. Moreover, formulation (1) is sometimes favorable, e.g., if 
node- weighted capacity constraints should be incorporated. 

One way to deal with symmetry is to restrict the feasible region in each of the 
orbits to a single representative, e.g., to the lexicographically maximal (with respect 
to the row- by- row ordering of the .x-components) element in the orbit. In fact, this 
can be done by adding inequalities to the model that enforce the columns of x to be 
sorted in a lexicographically decreasing way. This can be achieved by 0{pq) many 
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column inequalities. In [18] even a complete (and irredundant) linear description of 
the convex hull of all 0/1-matrices of size pxq with exactly one 1-entry per row and 
lexicographically decreasing columns is derived; a shorter proof of this completeness 
result appears in [7] . The corresponding polytopes are called partitioning orbitopes. 
A similar result can be proved for the case of packing orbitopes, in which there is 
at most one 1-entry per row. The descriptions basically consist of an exponentially 
large super class of column inequalities, called shifted column inequalities, for which 
there is a linear time separation algorithm available. We recall some of these results 
in Sect. 2. 

Incorporating the inequalities from the orbitope description into the IP model 
removes symmetry. At each node of the branch-and-cut tree this ensures that the 
corresponding IP is infeasible as soon as there is no representative in the subtree 
rooted at that node. In fact, already the column inequalities are sufScient for this 
purpose. 

In this paper, we investigate a way to utilize these inequalities (or the orbitope 
that they describe) without explicitly adding any of the inequalities to the models. 
The reason for doing this is the unpleasant effect that adding (shifted) column in- 
equalities to the models might result in more difficult LP relaxations. One way of 
avoiding the addition of these inequalities to the LPs is to derive logical implica- 
tions instead: If we are working in a branch-and-cut node at which the a;-variables 
corresponding to index subsets Iq and Ii are fixed to zero and one, respectively, 
then there might be a (shifted) column inequality yielding implications for all repre- 
sentatives in the subtree rooted at the current node. For instance, it might be (and 
this is easy to check for a given inequality) that for some {i* ^ /q U /i we have 
Xi*j* = for all 0/1-points x with Xij = £ /o) and Xij — 1 G Ii) that 

satisfy the inequality. In this case, Xi*j* can be fixed to zero for the whole subtree 
rooted at the current node, enlarging Jq. Similarly, also fixings of variables to one 
might be possible. We call the iterated process of searching for such additional 
fixings sequential fixing with (shifted) column inequalities. 

Let us mention at this point that deviating from parts of the literature, we do 
not distinguish between "fixing" and "setting" of variables in this paper. 

Sequential fixing with (shifted) column inequalities is a special case of constraint 
propagation, which is well known from constraint logic programming, see [2, 16, 27] 
for an overview. Modern IP solvers like SCIP [1] use such strategies also in the 
node preprocessing during the branch-and-cut algorithm. With orbitopes, however, 
we can aim at something better: Consider a branch-and-cut node identified by 
fixing the variables corresponding to sets /q and Ii to zero and one, respectively. 
Denote by W(/o,/i) the set of all vertices x of the orbitope with Xij = for all 
G Iq and Xij = 1 for all (i, j) G Ji. We define the sets /q and of all indices 
of variables, for which all x in W(/o,/i) satisfy Xi*j* = and Xi*j* = 1, 
respectively. We call the respective fixing of the variables corresponding to /q and 7* 
simultaneous fixing. Simultaneous fixing is always at least as strong as sequential 
fixing. 

Investigations of sequential and simultaneous fixing for orbitopes are the central 
topic of the paper. The main contributions and results are the following: 
o We present a linear time algorithm for orbitopal fixing, i.e., for solving the 
problem to compute simultaneous fixings for partitioning orbitopes (Theorem 4) 
and packing orbitopes (Corollary 1). 



4 



VOLKER KAIBEL, MATTHIAS PEINHARDT, AND MARC E. PFETSCH 



o In contrast to this, we prove that orbitopal fixing for covering orbitopes (the 
convex hulls of all lexicographically maximal 0/1-matrices with at least one 
1-entry in everj^ row) is NP-hard (Theorem 5). 

o We show that, for general 0/1-polytopes, sequential fixing, even with complete 
and irredundant linear descriptions, is weaker than simultaneous fixing (The- 
orem 2). For the case of partitioning orbitopes, we clarify the relationships 
between different versions of sequential fixing with (shifted) column inequali- 
ties, where (despite the situation for general 0/1-polytopes) the strongest one 
is as strong as orbitopal fixing (Theorem 3). 

o We report on computer experiments (Sect. 7) with the graph partitioning prob- 
lem described above, showing that orbitopal fixing leads to significant perfor- 
mance improvements for branch- and-cut algorithms. 

This paper extends the one that appeared in the proceedings of IPCO XII [17]. 
It contains the following additional material: a proof for the second part of The- 
orem 3, the above mentioned results for packing and covering orbitopes (Sect. 5), 
a comparison to the related approaches of Margot [22, 23, 25] and Linderoth et 
al. [20, 30] for the orbitope case (Sect. 6), and, finally, computational results for a 
significantly improved version of our graph partitioning code (Sect. 7). 

While our methods are based on lexicographically maximal choices of represen- 
tatives from the orbits, a more general approach admitting orderings defined by 
arbitrary Hnear functions was introduced by Friedman, see [12]. There are also a 
number of approaches for symmetry handling available from the constraint logic 
programming literature, see, e.g., [8, 31, 33]. Their general idea is similar to the 
above mentioned approaches by Margot and Linderoth et al. During the traversal 
of the tree, different techniques are used to avoid the processing of (some) symmet- 
ric parts of the tree. For an excellent survey of methods for symmetry breaking in 
integer programming we refer to [26]. 

2. Orbitopes 

Throughout the paper, let p and q be integers with p > q > 2. The partition- 
ing/packing/covering orbitope 0~ ^ is the convex hull of all 0/1-matrices 
X e {0, 1}W^M with exactly/at most/at least one 1-entry per row, whose columns 
are in non- increasing lexicographical order, i.e., they satisfy 

f^2^'-^x,, >f^2^-V,,,+i (2) 

for all j e[q-l\. 

We will mainly be concerned with partitioning orbitopes 0~,^. An exception is 
Sect. 5, in which we will show that the linear time method for orbitopal fixing of 
Sect. 4 below can easily be carried over to packing orbitopes O^^^, while there is 
no polynomial time method for orbitopal fixing for covering orbitopes O^^^, unless 
P = NP. 

Let the symmetric group of size q act on {0, 1}1''1^I*1 via permutation of the 
columns. Then the vertices of O^^ are exactly the lexicographically maximal ma- 
trices with exactly one 1-entry per row in the orbits under the symmetric group 
action; the lexicographic order is defined as in (2). 
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(a) (b) (c) (d) 



Figure 1: (a) Example for coordinates (9, 5) = (5, 5). (b), (c), (d) Three shifted column inequal- 
ities, (b) being a column inequality 



As these vertices have Xij = for all («, j) with i < j, we drop these components 
and consider O^^ as a subset of the space R-^p" with Ip^q := {(«, j) £ {0, IjlPl^I?! : 
i > j}- Thus, we consider matrices, in which the i-th row has q{i) := mm{i,q} 
components. 

The main result in [18] is a complete linear description of 0"^. In order to 
describe the result, it will be convenient to address the elements in lp,g via a 
different "system of coordinates": For j G [q] and 1 < rj < p — j + 1, define 
{r],j) '■= {j + r] — Thus (as before) i and j denote the row and the column, 

respectively, while rj is the index of the diagonal (counted from above) containing 
the respective element; see Figure 1 (a) for an example. 

A set 5 = {(1, ci), (2, C2), . . . , (77, c^)} C Ip,q with ci < C2 < ■ • • < c,, and 
77 > 1 is called a shifted column. For (i,j) = (v^j) ^ ^p.q^ ^ shifted column S 
as above with < j, and the set B = + 1), . . . , {i, q{i))} , we call 

x{B) — x{S) < a shifted column inequality. The set B is called its bar: In case of 
ci — ■ • ■ — = j — 1 the shifted column inequality is called a column inequality. 
See Figure 1 for examples. 

Finally, a bit more notation is needed. For each z £ [p], we define row^ := {(«, j) : 
j e [?(*)]}• For ^ C Xp,q and x € R-^p ', we denote by x{A) the sum J2{ij)eA ^ij- 

Theorem 1 (see [18]). The orbitope 0"^ is completely described by the non- 
negativity constraints Xij > 0, the row-sum equations x{rowi) = 1, and the shifted 
column inequalities. 

In fact, in [18] it is also shown that, up to a few exceptions, the inequalities 
in this description define facets of O"^. Furthermore, a linear time separation 
algorithm for the exponentially large class of shifted column inequalities is given. 
For a compact extended formulation of 0"^ that also leads to a simplified proof of 
Theorem 1, see [7]. 

3. The Geometry of Fixing Variables 

In this section, we deal with general 0/1-integer programs and, in particular, their 
associated polytopes. We will define some basic terminology used later in the 
special treatment of orbitopes, and we are going to shed some light on the geometric 
situation of fixing variables. 
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For some positive integer d, we denote by 

= {x eBf^ : < Xi < 1 foT all i G [d]} 

the 0/1-cube, where [d] is the corresponding set of indices of variables. For two 
disjoint subsets Iq, I\ C [rf] (hence, IqC\I\ = 0) we call 

{a; e : Xi = for all i G Iq, Xi = 1 for all i G 7i} 

the face of defined by (/q, h). All nonempty faces of are of this type. 

For a polytope P C C'' and for a face F of defined by (Jo,/i), we denote 
by FixF(-P) the smallest face of & that contains P f) F (1 {0, 1^ (i.e., FixF(-P) 
is the intersection of all faces of that contain P n F n {0, 1}''). If Fix_F(P) is 
the nonempty cube face defined by {Iq,Ii), then Jq and consist of all i G [d] 
for which Xj = and x, = 1, respectively, holds for all x G P fl F fl {0, 1}''. In 
particular, we have Iq Q Iq and Ii C /*, or FixF(P) = 0- Thus, if In and Ii 
are the indices of the variables fixed to zero and one, respectively, in the current 
branch-and-cut node (with respect to an IP with feasible points P fl {0, l}"^), the 
node can either be pruned, or the sets Iq and /* yield the maximal sets of variables 
that can be fixed to zero and one, respectively, for the whole subtree rooted at this 
node. Unless Fixi?(P) = 0, we call (/g,/*) the fixing of P at {Io,Ii). Similarly, 
we call Fixir(P) the fixing of P at F. 

Remark 1. If P. P' C C'' are two polytopes with P C P' and F and F' are two 

faces ofC^ with F C F' , then FixpiP) C Fixi./(P') holds. 

In general, it is not clear how to compute fixings efficiently. Indeed, computing 
the fixing of P at (0,0) includes deciding whether P fl {0,1}'' = 0, which, of 
course, is NP-hard in general. On the other hand, the following holds. 

Lemma 1. // one can optimize a linear function over P n {0. 1}'' in polynomial 
time, the fixing (/q,/*) at (/o,/i) can be computed in polynomial time. 

Proof. Let c G M,'^ be the objective function vector defined by 

1 if i G /i 

-1 if i G lo for all i G [d]. 

otherwise 

For each i* G [d] \ {Iq U /i) we have 

max{(c + (Bi*)^a; : X G Pn{0, 1}''} < |/i| + l 

(where cBj is the ith unit vector) if and only if i* € Iq, and 

max{(c-(Bi*)^a; : X G Pn{0, 1}'^} < |Ji| 

if and only if i* G /j* . Thus, wc can compute Iq and /* by solving 2{d— \Iq \ — |/i |) 
many linear optimization problems over P fl {0, 1}''. □ 

Note that the reverse to the imphcation stated in Lemma 1 does not hold, in 
general. This can, e.g., be seen at the example of 0/1-knapsack problems with 
P = {x G M'* : ^2^=1 "■'^'i ~ ^} (with oi, . . . ,0^ > 0). For every {Iq,Ii) the fixing 
{Iq,Ii) can be computed in linear time: 

/q = /o U {i'' ^ /o U /i : tti* + ^ aj > b}, l^ = h. 

jell 
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In contrast, the optimization problem over P D {0, l}*^ is NP-hard. 

If the linear optimization problem over P f) {0, 1}'' cannot be solved efficiently, 
one can still try to compute (hopefully large) subsets of Iq and by considering 
relaxations of P. In case of an IP that is based on an intersection with an orbitope, 
one might use the orbitope as such a relaxation. We will deal with the fixing 
problem for partitioning orbitopes in Sect. 4 (and for packing and covering orbitopes 
in Sect. 5). Since the optimization problem for partitioning and packing orbitopes 
can be solved in polynomial time (see [18]), by Lemma 1, the corresponding fixing 
problems can be solved in polynomial time as well. However, we will even describe 
linear time algorithms for these cases. 

If P is given via an inequality description, one possibility is to use the knap- 
sack relaxations obtained from single inequalities among the description. For each 
of these relaxations, the fixing can easily be computed. If the inequality system 
describing P is exponentially large, and the inequalities are only accessible via a 
separation routine, it might in some cases nevertheless be possible to decide effi- 
ciently whether any of the exponentially many knapsack relaxations allows to fix 
some variable (see Sect. 4.2). 

Suppose, P = {x G : Ax < b} and Pr = {x £ : ajx < hr} is the knapsack 
relaxation of P for the rth-row ajx < hr of Ax < b, where r = 1, . . . ,m. Let F be 
some face of C . The face G of C obtained by setting G '■— F and then iteratively 
replacing G by FixG(-Pr) as long as there is some r € [m] with FixG(^V) 5 G, is 
denoted by FixF(^a; < b). Note that the outcome of this procedure is independent 
of the choices made for r, due to Remark 1. We call the pair {Iq,I\) defining the 
cube face Fixi?(^.T < h) (unless this face is empty) the sequential fixing of Ax < b 
at (/o,/i). In the context of sequential fixing we often refer to (the computation 
of) FixF(f') as simultaneous fixing. 

Due to Remark 1, it is clear that FixpiP) Q FixF(^a; < b) holds. 

Theorem 2. In general, even for a system of facet-defining inequalities describing 
a full- dimensional 0/1-polytope, sequential fixing is weaker than simultaneous fixing. 

Proof. The following example shows this. Let P C be the four-dimensional 
polytope defined by the trivial inequalities Xi > for i G {1,2,3}, Xi < 1 for 
i € {1,2,4}, the inequality —Xi X2 + x^ — x^ < and Xi — X2 + X3 — X4 < 0. 
Let F be the cube face defined by ({4},0). Then, sequential fixing does not fix 
any further variable, although simultaneous fixing yields Iq = {3,4} (and /* — 0). 
Note that P has only 0/1-vertices, and all inequalities are facet defining (0:4 > 
and X3 < 1 are implied). □ 

4. Fixing Variables for Partitioning Orbitopes 

For this section, suppose that Io,Ii C Ip g are subsets of indices of partitioning 
orbitope variables with the following properties: 
(PI) |/o n rowil < q{i) - 1 for all i G \p]. 

(P2) For all G 7i, we have (ij) G Iq for all £ G [q{i)] \ {j}. 
In particular, PI and P2 imply that /o O/i = 0. Let F be the face of the 0/1-cube 
C^"'" defined by (/o,/i)- Note that if PI is not fulfilled, then OJ^ nP = 0. The 
following statement follows immediately from Property P2. 

Remark 2. If a vertex x of 0~g satisfies Xij = for all G Iq, then x G F. 
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(a) (b) 



Figure 2: (a): Example for Remark 4. Dark-gray entries indicate entries (i,Oi(/o)) and light- 
gray entries indicate the entries in Si{Io) for i = 6. 

(b): Example for Lemma 3. As before, dark-gray entries indicate entries (i, cti). Light-gray entries 
indicate entries (j,/^i(/o))- The *'s indicate Is set in the point x* as constructed in Lemma 3. 



We assume that the face Fbip{0~^) is defined by {Iq,Ii), if Fixp{0~g) is not 
empty. Orbitopal fixing (for partitioning orbitopes) is the problem to compute the 
simultaneous fixing (/g,/*) from (/o,/i), or determine that Fixi?(0~^) — 0. 

Remark 3. //Fixi?(0~^) 7^ 0, it is enough to determine Iq, as we have £ 
if and only if {i, i) e Iq holds for for all £ € [q{i)] \ {j}. 

4.1. Intersection of Partitioning Orbitopes with Cube Faces 

We start by deriving some structural results on partitioning orbitopes that are 
crucial in our context. Since 0~,j C C"^'" ' is a 0/1-polytope (i.e., it is integral), we 
have conv(Oj^^ nF n {0, 1}-^"") — O^ ^ (IF. Thus, Fix_F(0^^) is the smallest cube 
face that contains the face O^^ CiF of the orbitope O^^. 

Let us, for i € [p], define values Ui Q!i(/o) £ [<?(*)] recursively by setting ai := 1 
and, for all i £ [p] with i > 2, 

J ai^i if tti-i = q{i) or {i, a^^i + I) e Iq 

Oil ■— s 

I tti-i -I- 1 otherwise. 

The set of all indices of rows, in which the a-value increases, is denoted by 

r(/o) {ie[p] : i> 2, a. = a^_i -I- 1} U {1} 

(where, for technical reasons, 1 is included). 

The following observation follows readily from the definitions. 

Remark 4. For each i £ [p] with i > 2 and ai{Io) < the set Si{If)) := 

{{k, ak{Io) + 1) ■ k € \ r(/o)} is a shifted column with Si{Io) C Iq. 

Figure 2 (a) shows an example. 

Lemma 2. For each i £ [p], no vertex of 0~g DF has its 1- entry in row i in a 
column j £ [q{i)] with j > ai{Io)- 

Proof. Let i £ [p]. We may assume Q;i(/o) < 9(1), because otherwise the statement 
is trivially true. Thus, B := £ row^ : j > ai{Io)} ^ 0. 

Let us first consider the case i £ r(/o). As we have Q;i(/o) < q{i) < i and 
ai(Jo) = 1, there must be some k < i such that k ^ r(/o). Let k be maximal 
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with this property. Thus, we have k' G r(/o) for all 1 < A; < fc' < i. According to 
Remark 4, x{B) — x{Sk{Io)) < is a shifted column inequality with x{Sk{Io)) = 0, 
showing x{B) = as claimed in the lemma. 

Thus, let us suppose i G [p] \r(/o). If ai(/o) > — 1, the claim holds trivially. 
Otherwise, B' := B \ {(i,ai(/o) + 1)} 7^ 0. Similarly to the first case, now the 
shifted column inequality x{B') — a;(5'i_i(/o)) < proves the claim. □ 

For each i e [p], we define Hi{Io) '■= niin{j G [q{i)] ■ ^lo}- Because of 

Property PI, the sets over which we take minima here are non-empty. 

Lemma 3. If we have /tti(/o) ^ Ci-i{Io) for all z G [p], then the point x* = x*{Io) G 
{0, l}^"-" defined by a;*„^(j^) = 1 for all i G r(/o), = 1 for all i G b]\r(/o), 

and all other components being zero, is contained in 0~ ^ C\F . 

Proof. Due to ai(/o) < ai-i(/o) + 1 for all i G [p] with z > 2, the point x* is 
contained in OJ^^. It follows from the definitions that x* does not have a 1-entry 
at a position in /q. Thus, by Remark 2, we have x* G F. □ 

Figure 2 (b) shows an example for the point constructed in Lemma 3. 

We now characterize the case 0"^ OF = (leading to pruning the corresponding 
node in the branch-and-cut tree) and describe the set Iq. 

Proposition 1. 

(1) We have O"^ OF = if and only if there exists i G [p] with /ii(io) > cci{Io)- 

(2) If fii{Ia) < Q!i(/o) holds for all i G [p], then the following is true. 

(a) For all i G [p] \ r(/o), we have 

Iq nrow, = G row, : (i, j) G Iq or j > a,(/o)}. 

(b) For all i € [p] with /L(i(/o) = Q;i(/o), we have 

Iq n rowi = rowj Q!j(/o))}. 

(c) For all s G r(/o) with iJ,a{Io) < <^s{Io) the following holds: If there is 
some i> s with /ii(/o) > cxi{Io U {(s, as(/o))}), then we have 

/q n roWs = rowg \{(s, as{Io))}. 

Otherwise, we have 

/gnrows = {(s,j) GroWs : (s,j)G/o or j > as{Io)}- 

Proof. Part 1 follows from Lemmas 2 and 3 (see also Figure 3 (a)). 

In order to prove Part 2, let us assume that Hi{Io) < Q!i(-fo) holds for all i G [p]. 
For Part 2a, let i E [p] \ r(/o) and G rowj. Due to Iq C /q, we only have to 
consider the case ^ Iq- If j > cti{Io), then, by Lemma 2, we find G Iq. 
Otherwise, the point that is obtained from x*(/o) (see Lemma 3) by moving the 
1-entry in position (i,/i,(7o)) to position is contained in OJ ^ flF, proving 

In the situation of Part 2b, the claim follows from Lemma 2 and O^ ^ (iF ^ 
(due to Part 1). 

For Part 2c, let s G r(/o) with /Xs(/o) < ois{Io) and define Iq := /oU{(s, as(/o))}. 
It follows that we have /Ui(/o) = Mj(-^o) for all i G [p]; compare also Figure 3 (b). 

Let us first consider the case that there is some i > s with /ii(/o) > cti (7q). Part 1 
(applied to Iq instead of Iq) implies that 0~g CiF does not contain a vertex x 
with Xg^asiio) ~ 0- Therefore, we have {s,as{Io)) G I*, and thus Iq fl rowg = 
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(a) (b) (c) (d) 

Figure 3: (a): Example for Proposition 1 (1). Light-gray entries indicate the entries (i,iii{Io)) 
and dark-gray entries indicate entries (i,ai(/o))- (b): Example of fixing an entry to 1 for Propo- 
sition 1 (2c). As before light-gray entries indicate entries (i,/ii(/o))- Dark-gray entries indicate 
entries (i, «i(/o U {{s, as{Io))})) with s = 3. 

(c) and (d): Gray entries show the SCIs used in the proofs of Parts 1(a) and 1(b) of Theorem 3, 
respectively. 



rowg \{(s, as(/o))} holds (where for "C" we exploit Op^CiF ^ by Part 1, this 
time applied to Iq). 

The other case of Part 2c follows from s ^ ^(Iq) and as{Io) = as{Io) — 1- 
Thus, Part 2a appHed to Iq and s instead of Iq and i, respectively, yields the claim 
(because of (s, as(/o)) ^ Iq due to s e r(/o) and ,j nF ^ 0). □ 



4.2. Sequential Fixing for Partitioning Orbitopes 

Let us, for some fixed p > q > 2, denote by iSsci the system of the non-negativity 
inequalities, the row-sum equations (each one written as two inequalities, in order 
to be formally correct) and all shifted column inequalities. Thus, according to 
Theorem 1, O^^ is the set of all x S R-^p" that satisfy 5sci- Let Sqi be the 
subsystem of 5sci containing only the column inequalities (and all non-negativity 
inequalities and row-sum equations). 

At first sight, it is not clear whether sequential fixing with the exponentially 
large system 5sci can be done efficiently. A closer look at the problem reveals, 
however, that one can utilize the linear time separation algorithm for shifted column 
inequalities (mentioned in Sect. 2) in order to devise an algorithm for this sequential 
fixing, whose running time is bounded by 0{gpq), where g is the number of variables 
that are fixed by the procedure. 

In fact, one can achieve more: One can compute sequential fixings with respect 
to the affine hull of the partitioning orbitope. In order to explain this, consider 
a polytope P = {x e : Ax < 6}, and let S" C R'' be some affine subspace 
containing P. As before, we denote the knapsack relaxations of P obtained from 
Ax < 6 by Pi, Pm. Let us define Fixf.(Pr) as the smallest cube face that 
contains Pr n fl {0, 1}'' n F. Similarly to the definition of FixF{Ax < b), denote 
by Fijip{Ax < b) the face of that is obtained by setting G ■— F and then itera- 
tively replacing G by FixQ(Pr) as long as there is some r G [m] with FixQ(Pr) C G. 
We call Fixp{Ax < b) the sequential fixing of Ax < b at F relative to S. Obviously, 



ORBITOPAL FIXING 



11 



we have Fixi?(P) C Fix|.(^a; <b)C FixpiAx < b). In contrast to sequential fix- 
ing, sequential fixing relative to affine subspaces in general is NP-hard (as it can 
be used to decide whether a linear equation has a 0/1-solution). 

Theorem 3. 

(1) There are cube faces , F"^ , F^ with the following properties: 

(a) Fixp-i(5sc/) C Fixp-i(5c/) 

(b) Fix^'^'°-'Vc/) C Fh,F.{Ssci) 

, . aff(0-„) aff(0-„) 

(c) Fix^3 (SsGi) C Fix^3 {Sci) 

(2) For all cube faces F, we have F\Xp^'^'''''\Ssci) = FixF(Op,g)- 

Proof. For Part (la), we chose p = 5, g = 4, and define the cube face F^ via 
/(I = {(3. 2), (5.1), (5. 2), (5. 3)} and 1} = {(1, 1), (5, 4)}. The shifted column in- 
equality with shifted column {(2,2), (3,2)} and bar {(5,4)} allows to fix 2:22 to 1 
(see Fig. 3 (c)), while no column inequality (and no non-negativity constraint and 
no row-sum equation) allows to fix any variable. 

For Part (lb), let p = 4, g = 4, and define F'^ via = {(3, 2), (4, 1), (4, 2)} and 
If = {(1, 1)}. Exploiting that 0:43 + X44 ~ 1 for all x G aff(Op^) fl F^, we can use 
the column inequality with column {(2, 2), (3, 2)} and bar {(4, 3), (4, 4)} to fix 2:22 
to one (see Fig. 3 (d)), while no fixing is possible with .Ssci only. 

For Part (Ic), we can use F^ = F^. 

In order to prove Part (2), we have to show for every cube face F that 

Fixf^°-'^(5sci)CFix^(0=,) 

holds. We use the notation introduced in Sect. 4. The crucial fact is that ev- 
ery point .T 6 F n a.S{Op g) satisfies x{B) = 1 for every B C row, such that 
{{l,H,iIo)),...,{l,q{l))}CB. 

Let us first consider the case FixF(0~q) = 0. Due to Part 1 of Prop. 1 there is 
some i € [p] with /ii(/o) > cti{Io). Therefore, the SCI x{B)~x{S) < constructed in 
the proof of Lemma 2 with = (z,/Xj(/o)) has x{B) = 1 for all x G Fnaff(Opg), 
but x{S) = due to S C Iq. This shows that we indeed have 

Fixf^°'^''^(5sci) = 

in this case. 

Otherwise (i.e., FixF(Op^) 7^ 0), it suffices to show for each {k,£) £ Iq\Io that 
there is some SCI that can only be satisfied by some .t G F fl a,S{Op g) if Xke = 
holds. Due to Part 2 of Prop. 1, we have to consider two cases. 

Case 1: We have I > ak{Io)- Then the SCI x{B) — x{S) < constructed in the 
proof of Lemma 2 with = {k,£) implies x{B) = for all x € F (because of 
S Q Iq), which yields Xke = due to {kj.) G B. 

Case 2: We have k G r(/o) with (. < ak{h) and there is some r >k with iir{h) > 
ar{Io U {(fc, afe(/o))}). Then the SCI x{B) — x{S) < constructed in the proof of 
Lemma 2 with = (r, /ir(/o)) (and Iq replaced by Iq U {{k,ak{Io))}) satisfies, 
for each a: G F n aff(Op ,^), x{B) = 1 and x{S) = x^^. (due to C /q U 

{(fc, afe(Jo))}), which implies X(^k,akiio)) ~ ^' ^^"^ hence (as x G F n afl[(0~^)) 
a;/j£ = 0, because of ^ ^ Q!fc(/o)- D 
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Algorithm 1 Orbitopal Fixing 

1: Set Iq ^ lo, /J' <— i"i , /ii <— 1, ai <— 1, and T = 0. 

2: for i = 2, . . . ,p do 

3: compute m <— min{j : ^ lo}- 

4: if ai-i = q{i) or (i, ai-i + 1) G /o then 

5: ai <— ai-i 

6: else 

7: tti ^ +1, r ^ ru{i} 

8: if fii > O-i then 

9: return "Orbitopal fixing is empty" 

10: Set II U{(j,i) : j > qJ. 

11: if \ll n rowj \ = g(i) - 1 then 

12: set 7J ^ Jj* U (rowi\/^). 

13: for all s e r with {s,as) ^ do 

14: Set /3s ^ as - 1. 

15: for i = s + 1, . . . ,p do 

16: if Pi-i = q(i) or + 1) £ lo then 

17: ft Pi-i 

18: else 

19: ft^ft_i + l 

20: if Hi > Pi then 

21: q *- qu {{s, as)} and ^ rows \{(s, a^)}. 

22: Proceed with the next s in Step 13. 



The different versions of sequential fixing for partitioning orbitopes are domi- 
nated by each other in the following sequence: 

Sci {<Ssci, afiine Sci} affine <Ssci, 

which finally is as strong as orbitopal fixing. For each of the arrows there exists an 
instance for which dominance is strict. The examples in the proof of Theorem 3 
also show that there is no general relation between iSsci and affine Sci- 

In particular, we could compute orbitopal fixings by the polynomial time algo- 
rithm for sequential fixing relative to aff(0~g). It turns out, however, that this is 
not the preferable choice. In fact, we will describe below a finear time algorithm 
for solving the orbitopal fixing problem directly. 

4.3. An Algorithm for Orbitopal Fixing 

Algorithm 1 describes a method to compute the simultaneous fixing (/q,/^^) from 
{Io,Ii) (which are assumed to satisfy Properties PI and P2). Note that we use Pi 
for ai{Io U {{s,as{Io))})- 

Theorem 4. The orbitopal fixing problem for partitioning orbitopes can be solved 

in time 0{pq) (by a slight modification of Algorithm 1). 

Proof. The correctness of the algorithm follows from the structural results given in 
Proposition 1. 

In order to prove the statement on the running time, lot us assume that the data 
structures for the sets /q, /i, Iq, and allow both membership testing and addition 
of single elements in constant time (e.g., the sets can be stored as bit vectors). 

As none of the Steps 3 to 12 needs more time than 0(g), we only have to take 
care of the second part of the algorithm starting in Step 13. (In fact, used verbatim 
as described above, the algorithm might need time 0(p^).) 
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For s,s' e T with s < s' denote the corresponding /^-values by Pi {i > s) and 
by Pi {i > s'), respectively. We have /3, < PI for all i > s' , and furthermore, if 
equality holds for one of these i, we can deduce Pk = P'f. for all k > i. Thus, as 
soon as a pair (i, Pi) is used a second time in Step 20, we can break the for-loop in 
Step 15 and reuse the information that we have obtained earlier. 

This can, for instance, be organized by introducing, for each G Tp.q, a flag 
f{i,j) € {red, green, white} (initialized by white), where f{i,j) = red / green means 
that we have already detected that Pi = j eventually leads to a positive/negative 
test in Step 20. The modifications that have to be applied to the second part of the 
algorithm are the following: The selection of the elements in T in Step 13 must be 
done in increasing order. Before performing the test in Step 20, we have to check 
whether f{i,Pi) is green. If this is true, then we can proceed with the next s in 
Step 13, after setting all flags f{k,Pk) to green for s < A; < i. Similarly, we set all 
flags f{k,Pk) to red for s < k < i, before switching to the next s in Step 22. And 
flnally, we set all flags f{k, Pk) to green for s < k < p at the end of the body of the 
s-loop starting in Step 13. 

As the running time of this part of the algorithm is proportional to the number of 
flags changed from white to red or green, the total running time indeed is bounded 
by 0{pq) (since a flag is never reset). □ 

5. Fixing for Packing and Covering Orbitopes 

The packing orbitope O^^ obviously can be obtained from the partitioning or- 
bitope 0~_^_i g_^_i by projecting out the first column and row, i.e., by orthogonal 
projection to the coordinate subspace associated with 

{(«, j) e lp+i,q+i ■■ i,j > 1} 

(and renaming the variables appropriately), see also [18]. 

In general, for J C [d], the orthogonal projection tt : ]R}'^ — > R"^, a polytope 
P C C'', and some face F of the cube C'' = [0, l]'', we have 

Fix,(p)(F)=7r(Fixp(7r-i(P))), 

since for every face G of C"^ 

(7r(P) n F) C G <^ (P n n-\F)) C ■k-\G) 

holds (simply because taking preimages commutes with taking intersections). 
Thus, the following result for packing orbitopes follows readily from Theorem 4. 

Corollary 1. Variable fixing for packing orbitopes O^^ can be done in time 0{pq) 
by reduction to orbitopal fixing for 0~_^_l q_^_l . 

In contrast to this, variable fixing for covering orbitopes O^g cannot be done in 
polynomial time, unless P = NP, as the following result impHes. 

Theorem 5. The problem to decide whether, for given io C [p] x [q], the covering 
orbitope 0-^ contains a vertex x* S O-^ with x*j = for all G lo, is NP- 

complete. 

Proof. It suffices to show that one can construct, for each graph G = {V,E) and 
fc G N with k < \V\, (in time bounded polynomially in |y|) an instance of the 
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Figure 4: Example for the construction in the proof of Theorem 5. Let G be the depicted graph, 
and let k = 7, i.e., k = [log2(A; + 1)] =3 and k = 7. Thus, no new edge is needed, and we have 
G = G. In the matrix, Os correspond to elements of Iq, Is have been set in the construction of a 
feasible solution, and empty entries correspond to Os set in the construction. The top right shows 
the binary tree used for the construction with ai, . . . , 07 = 2, 6, 10, 14, 18, 22, 26. 



decision problem described in the theorem whose answer is "yes" if and only if G 
has a vertex cover of size at most k. 

Towards this end, let k := [log2(A; + 1)] be the smallest integer such that we 
have fc := 2" — 1 > fc. Construct a graph G = {V,E) by adding k — k new edges 
(forming a matching) on 2{k — k) nodes disjoint from V . Thus G is a graph with 
I V"! = \ V\+2{k — k) nodes and m '■= \E\ = \E\+ k — k edges that has a vertex cover 
of size at most k if and only if G has a vertex cover of size at most k. 

For the instance of the decision problem described in the theorem, let p := K+m, 
q :— 2\V\, and assume V = {2, 4, 6, ... , q}. Numbering the edges of G by 

E = {ei,e2, . . . ,6^} with eh = {vh, Wh} C V for all h G [m], 

we set 

^0 •= {{k + h,j) : he [m], j € [q] \ {vh,Wh}}. 

See Figure 4 for an example. 

In order to prove that the answer to the constructed instance is "yes" if and only 
if G has a vertex cover of size at most k, let us call, for x* G {0, IjHxM^ a pair (i, j) 
an alibi (for column j of a;*), if a;*j_i — 1 and a;*- = hold. 

If X* G O^^ is a vertex of the covering orbitope O^^ (i.e., a 0/1-point in the 
orbitope) with x*j = for all € /q, then 

C := {v &V : a;*_|_/j ^ = 1 for some h G [m]} 

is a vertex cover in G (due to x*(TOWK+h) > 1 for all h G [m]). Moreover, for 
every v G C, there is an alibi {i,v) in some row i G [k], since column v of x* is 
lexicographically not larger than column v — 1. Again due to the lexicographical 
ordering of the columns, every vertex of the orbitope can have at most 2'~^ ahbis 
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in row i. It follows that 

K K—1 

\C\ < 2*-i = ^ 2^ = 2-* - 1 = fc. 

j=l i=0 

Conversely, suppose C C y is a vertex cover with |C| < k. We construct a 0/1- 
point X* e O^g with x*j = for all e /o as follows. First, for each h G [m], 
we set 

x^+h,vh ■ |q otherwise ^«+?»,«';. • otherwise. 

Since C is a vertex cover of G, the part of x* that we have already constructed 
has at least one 1-entry in every row k+1, . . . ,p. It thus remains to construct the 
first K rows such that they contain an alibi for every column v with v € C (and such 
that each of these rows contains at least one 1-entry). This can, e.g., be achieved 
as follows. 

First, choose an arbitrary sequence {ai, . . . ,a^.) (of length A: = 2" — 1) of numbers 
in V" = {2, 4, 6, ... , q} with C C {ai, . . . , aj,} (which is possible due to |C| < A;). 

Then a complete rooted binary tree (embedded into the plane) of height k (having 
2" — 1 = nodes) is constructed in which the nodes receive pairwise different labels 
1, 2, . . . , A;. Furthermore, the labels have to be assigned in such a way that for every 
node labeled t, we have ae < at for all labels £ in the left subtree and a* < Or for 
all labels r in the right subtree rooted at t. 

Then we complete x* to a vertex of O^^ by putting in each row i € [k] alibis at 
all positions {i,at) for t running through all labels of nodes at distance i — 1 from 
the root of the tree and filling the remaining components of x* accordingly. □ 

Of course. Theorem 5 implies that optimization over covering orbitopes is NP- 
hard. In particular, in contrast to the packing and partitioning orbitopes, we cannot 
expect to find a tractable linear description of O^^, unless NP = coNP. 

In fact, using ideas of the proof of Theorem 5, one can also estabHsh other similar 
statements, in which O^^ is replaced by the convex hull of all 0/1-matrices (whose 
columns are in lexicocraphically non-increasing order) with at least k one-entries 
per row for each fc > 1, or with exactly k one-entries per row for each A; > 2. 

6. Comparison with Isomorphism Pruning and Orbital Branching 

In [22, 23] Margot developed a related, but more general approach to symmetry 
breaking, called isomorphism pruning. The main components are a setting rule for 
variables and a pruning rule for nodes in a branch-and-bound tree to avoid consid- 
eration of equivalent (partial) solutions. In this section we outline the differences 
and similarities between Margot's and our approach when specialized to the type 
of symmetries addressed by partitioning orbitopes. 

Isomorphism pruning deals with arbitrary symmetries in any binary program 
minjc^a; : Ax < b,x € {0, 1}''}, or even integer program [23]. Let G be a group 
of permutations of the variables (inducing permutations of the components of c, 
the columns of A, and the set of feasible solutions) such that for every g € G we 
have g{c) = c and g{A) = <Tg{A), g[h) = (Jg{h) for some permutation (jg of the rows 
of A resp. components of b. In particular, G acts on the set of feasible solutions 
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(via coordinate permutations) with the property that the objective function cF x is 
constant on every orbit. Given an order of the variables (a rank vector R), Mar- 
got 's approach then assures that only (partial) solutions that are lexicographically 
minimal in their orbit under G are explored in the branch-and-bound tree. More 
precisely, a partial solution in some branch-and-bound node N is identified with 
the sets and Iq of variables that have been fixed to 1 and 0, respectively, in 
the path from the root to A''. A partial solution is lex-min in its orbit, i.e., it is a 
representative, if the set is lexicographically minimal with respect to the rank 
vector R. Thus, N can be pruned if it is not a representative. 

Note that the definition of the lexicographical order relies upon a particular 
total order of the variables defined by the rank vector R. It should be mentioned 
that in [23] this limitation was relaxed by using an arbitrary order that can be 
determined during the branch-and-bound process. Still, the same rank vector has 
to be used throughout the branch-and-bound tree. Essentially, this means that 
whenever a branching is to be performed at some level of the branch-and-bound 
tree for the first time, the branching variable for this level can be freely chosen, 
and the rank vector is extended. In [26], the restriction of a global rank vector is 
dropped, too. The rank vector at some node A'' of the branch-and-bound tree is 
now given by the branching decisions from the root node to N. 

At every node A'' of the branch-and-bound tree, two 0-setting operations are 
performed. If N was created by fixing the variable to then all variables in the 
(sub)orbit of a;/ under the stabilizer of if^ are set to 0, too, since a 1 for any of these 
variables would lead to a partial solution lex-greater than /f . Furthermore, in case 
the next branching variable is known (e.g., if a global rank vector is available), it 
is repeatedly tested whether any representative can be reached from N by checking 
whether the current representative together with is also a representative, i.e., 
whether Ij^ U {x^} is lexicographically minimal under G. 

For a comparison with orbitopal fixing, we consider isomorphism pruning spe- 
cialized for partitioning problems with 0/1-varaibles Xij satisfying J2j ^ij = 1 fo'" 
all i, where the symmetry group G is assumed to be the group of all permutations 
of the columns of the variable matrix x. Note that such problems do not require 
the elaborate machinery of group theoretic algorithms developed by Margot for the 
general case of more complicated symmetry groups. 

We assume that the canonical rank vector R is used, i.e., the one that describes 
the row- wise ordering of the variables Xij . For this ordering, the representatives 
used by isomorphism pruning are in one-to-one correspondence with faces of the 
cube [0, 1]-^P'« having nonempty intersections with 0~^. 

The row-wise ordering of the variables is natural choice. It moreover turned 
out from our computational experiments with graph partitioning problems that 
we could not find any alternative ordering yielding better results for isomorphism 
pruning, at least for this application. This even holds true for all variants we tested 
without a global rank vector (see Sect. 7). 

If branch-and-bound trees for both methods are obtained by minimum index 
branching, orbitopal fixing can be well compared to isomorphism pruning. In- 
deed (provided that the nodes are also processed in the same order and no cutting 
planes are added), orbitopal fixing will visit only branch-and-bound nodes that 
isomorphism pruning visits as well. In any of these nodes the orbitopal fixing al- 
gorithm does not perform 1-fixings in loop 13 in Algorithm 1. However, it may do 
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Figure 5: Example of a branch-and-bound node in wiiicii tiie variables in the first five rows 
have been fixed by mimimum index branching. The zeroes below the fifth row have been set by 
orbitopal fixing after processing the current node, but not by isomorphism pruning. 



more zero-fixings in loop 2 than the ones that isomorphism pruning does. For an 
example see Figure 5. Thus, the main advantage of orbitopal fixing for our special 
case of symmetry can be seen as deriving as early as possible conclusions that hold 
at every child node. 

If other branching rules than minimum index branching are applied, the trees 
produced by using orbitopal fixing and isomorphism pruning are not comparable, in 
the sense that, in general, both variants will visit branch-and-bound nodes (partial 
solutions) not visited by the other one. But even at nodes that appear in both 
trees the behaviour of the two methods cannot really be compared unless both 
use the same representatives. This, however, is only the case if the isomorphism 
pruning variant does minimum index branching, yielding (locally) the canonical 
rank vector. Hence, we are in the situation already discussed above (no matter by 
which branching rule the orbitopal fixing variant has arrived at the node). 

Another approach for avoiding symmetrical solutions is orbital branching [20]. 
It handles the same type of symmetry as isomorphism pruning, but in a local 
manner: the symmetry group of the current LP is computed on-the-fly for every 
node of the branch-and-bound tree after removing fixed variables and inequalities 
that are satisfied regardless of the unfixed variables. The set of unfixed variables 
then decomposes into orbits of equivalent variables under the symmetry group of 
the current LP. Then for some orbit of equivalent variables a two-way branching 
is done, where in one branch the case is considered that all variables in the orbit 
are zero, and in the other branch a chosen variable from the orbit is fixed to one. 
Orbital branching is thus not comparable to either orbitopal fixing nor isomorphism 
pruning, as it can consider symmetries that only arise after fixing variables. On the 
other hand a problem usually looses global symmetry when variables are removed. 
As the authors of [20] already point out, it is quite time consuming to compute 
the symmetry group for every branch-and-bound node. In [21] the authors report 
that a variant of orbital branching exploiting global symmetry is computationally 
superior, i.e., instead of computing the symmetry group of every local LP in the 
branch-and-bound tree, the global symmetry group of the root LP is used. In this 
variant, the symmetry group used in a branch-and-bound node is the subgroup 
of the global symmetry group that setwise stabilizes the variables already fixed 
to one. This, however, is very similar to isomorphism pruning. If there is some 
orbit O of equivalent variables at some node N, then orbital branching on O is the 
same as pruning by isomorphism and 0-setting relative to the branching decisions 
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leading to N. In this sense orbital branching with global symmetry can be seen as 
isomorphism pruning with local rank vectors. 

In the case of partitioning orbitopes, the symmetry has very simple structure. 
Note that any symmetry considered here stems from the permutation of columns of 
a matrix variable, which is a reasonable restriction in particular with regard to the 
example application presented in our graph partitioning formulation (1). At every 
branch-and-bound node, the columns of the matrix variable x decompose into one 
set of columns that are fixed elementwise, and one set of columns which still can 
be permuted arbitrarily. 

The special structure of partitioning problems considered here implies that the 
orbits described above are the same in both variants of orbital branching, at least 
as long as symmetry among rows of the matrix variable (graph automorphisms in 
an instance of graph partitioning) are ignored. 

The main advantage of orbital branching lies within its flexibility, e.g., orbital 
branching can be used at any node regardless how branching was performed on 
other nodes of the branch-and-bound tree. 

7. Computational Experiments 

We performed computational experiments for the graph partitioning problem men- 
tioned in the introduction. The code is based on the SCIP 1.1.0 branch-and-cut 
framework [32], originally developed by Achterberg [1]. We use CPLEX 11.00 as the 
underlying LP solver. The computations were performed on a 3.2 GHz Pentium 4 
machine with 4 GB of main memory and 2 MB cache running Linux. All compu- 
tation times are CPU seconds and are subject to a time limit of four hours. Since 
in this paper we are not interested in the performance of heuristics, we initialized 
all computations with the optimal primal solution. 

We compare different variants of the code by counting winning instances. An 
instance is a winner for variant A compared to variant B, if A finished within the 
time limit and B did not finish or needed a larger CPU time; if A did not finish, then 
the instance is a winner for A in case that B did also not finish, leaving, however, a 
larger gap than A. If the difference between the times or gaps are below 1 sec. and 
0.1 %, respectively, the instance is not counted. 

In all variants, we fix the variables Xij with j > i to zero. Furthermore, we 
heuristically separate general clique inequalities 

where 

b=lt{t-l){q-r) + ^t{t + l)r 

and C C 1/ is a clique of size tq + r > q with integers t>l,0<r<q (see [3]). 
The separation heuristic for a fractional point y* follows ideas of Eiscnblatter [5]. 
We generate the graph G' = (V, E') with {i, k} € E' if and only if {i, k) & E and 
Vik ^ b{h+ l)/2, where y* is the y-part of an LP solution. We search for maximum 
cliques in G" with the branch-and-bound method implemented in SCIP (with a 
branch-and-bound node limit of 10 000), as well as with simple tabu search and 
greedy strategies. We then check whether the corresponding inequality is violated. 
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Table 1: Results of the branch-and-cut algorithm. All entries are rounded averages over three 
instances. CPU times are given in seconds. 
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184 
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40 


540 
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32347 


5548 


5750 


40 


540 


9 


62053 
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162871 


7182 


43434 


4768 


25709 


40 


540 


12 


40598 


6018 


2187 


69 


174 


31 


166 


40 


720 


3 


488 


1399 


393 


1325 


366 


1080 


5 


40 


720 


6 


6888 


11139 


3507 


3957 


2563 


3263 


756 


40 


720 


9 


21746 


14400 


20743 


8220 


12753 


6820 


10656 


40 


720 


12 


24739 


14400 


68920 


9164 


21067 


6209 


23532 



We also separate triangle inequalities and both kinds of cycle inequalities as given 
in [3]. 

After extensive testing, wc decided to branch by default on the first index, i.e., 
wo branch on the first fractional a;-variable in the row-wise variable order used for 
defining orbitopes. A side-efi'ect of this choice is that this branching rule makes 
orbitopal fixing more comparable to isomorphism pruning, in particular, to the 
variant using an a priori fixed variable order. It should be noted that this branch- 
ing rule is superior only when the vertices are permuted (i.e., the rows of x are 
permuted) as follows: sort the vertices in descending order of their star weight, i.e., 
the sum of the weights of incident edges. 

We generated 36 random instances with p = 40 vertices and m edges of the 
following types. We used m = 360 (sparse), 540 (medium), and 720 (dense). 
For each type we generated three instances by picking edges uniformly at random 
(without recourse) until the specified number of edges is reached. The edge weights 
are drawn independently and uniformly at random from [1000]. For each instance 
we computed results for q= 3, 6, 9, and 12. 

In a first experiment we tested the speedup that can be obtained by performing 
orbitopal fixing. For this we compare the variant (basic) without symmetry break- 
ing (except for the zero-fixing of the upper right x-variables) and the version in 
which we use orbitopal fixing (OF); see Table 1 for the results. Columns nsub give 



100% 
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2h 













III 
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Figure 6: Computation times/gaps for the basic version (dark gray) and the version with or- 
bitopal fixing (light gray). Left: instances with n = 40, m = 540. Right: instances for n = 40, 
m = 720. The number of partitions q is indicated on the x-axis. Values above 4 hours indicate 
the gap in percent. 
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the number of nodes in the branch-and-bound tree. The results show that orbitopal 
fixing is clearly superior (OF winners: 30, basic winners: 0), see also Figure 6. 

Table 1 shows that the sparse instances are extremely easy, the instances with 
m = 540 are quite easy, while the dense instances are hard. A situation that often 
occurs for small m and large q is that the optimal solution is 0, and hence no work 
has to be done. For m = 540, the hardest instances arise when g = 9. It seems 
that for g = 3 the small number of variables helps, while for q = 12 the small 
objective function values help. Of course, symmetry breaking methods become 
more important when q gets larger. 

In a second experiment, we investigated the symmetry breaking capabilities built 
into CPLEX. We suspect that it breaks symmetry within the tree, but no detailed 
information was available. We ran CPLEX 11.00 on the IP formulation stated 
in Sect. 1. In one variant, we fixed variables Xij with j > i to zero, but turned 
symmetry breaking off. In a second variant, we turned symmetry breaking on 
and did not fix variables to zero (otherwise CPLEX seems not to recognize the 
symmetry). The symmetry breaking variant turned out to be effective: it was 
always faster than the other version. The variant with no symmetry breaking and 
no fixing of variables performed extremely badly. The results obtained by the OF- 
variant above are clearly superior to the best CPLEX results (CPLEX could not 
solve 21 instances within the time limit, while OF could not solve 1). This result 
can be explained, at least partially, by the separation of clique and other problem 
specific inequalities. 

In another experiment, we turned off orbitopal fixing and separated shifted col- 
umn inequalities in every node of the tree. The results are that the OF-version is 
slightly better than the SCI variant (OF winners: 19, SCI winners: 11), but the 
results are quite close (OF average time: 2330 seconds, SCI average time: 2288 
seconds). Although by Part 2 of Theorem 3, orbitopal fixing is as strong as fixing 
with SCIs (with the same branching decisions), the LPs get harder and the process 
slows down a bit. On the other hand, the SCIs are already active in the root node, 
which yields a better root bound. This may result in fewer branch-and-bound nodes 
due to potentially more fixings in the root node. 

Finally, we compared orbitopal fixing to the isomorphism pruning approach of 
Margot. We implemented the basic variant with fixed canonical variable order, the 
ranked branching rule (see [24]), as well as the variant with free branching deci- 
sions, each adapted to the special symmetry we exploit, which simplifies Margot's 
algorithm significantly. We decided to use the canonical order variant, as it gave 
the best results. It can be seen from Table 1 that isomorphism pruning is inferior 
to both orbitopal fixing (OF winners: 30, isomorphism pruning winners: 0) and 
shifted column inequalities (30:0), but is still a big improvement over the basic 
variant (28:2). 

We did not implement orbital branching, since this method when using global 
symmetry only is very similar to isomorphism pruning in our context as pointed 
out in Section 6. Hence, we decided to restrict to an experimental comparison to 
isomorphism pruning. 

It should be noted that, in contrast to orbitopal fixing, both isomorphism pruning 
as well as orbital branching could exploit symmetries of the instance graphs, too. 
However, no nontrivial graph automorphisms in our test instances were found by 
nauty, see [28]. 
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